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I— 5 | Abstract 
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CN ■ We consider the area A = (YaLi Xi(t)) dt of a self-similar fragmentation process 

X = (X(i),t > 0) with negative index. We characterize the law of A by an integro- 
differential equation. The latter may be viewed as the infinitesimal version of a recursive 
distribution equation that arises naturally in this setting. In the case of binary splitting, 
this yields a recursive formula for the entire moments of A which generalizes known results 
for the area of the Brownian excursion. 
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1 Introduction 

The distribution of the area Ae X c = J e s ds of a standard Brownian excursion (e s , < s < 1) 
appears in a variety of settings, including random graphs [15, 16], random trees and branching 
processes [17, 19], order statistics [18], hashing with linear probing [8], not to mention of 
course the study of Brownian motion for its own interest [12, 13] . The entire moments E(A| XC ) 
have a special importance, as they are related, for instance, to asymptotics as n — > oo for the 
number of connected graphs with n labelled vertices and n + k — l edges, see [15] and the survey 
[11]. We refer to Perman and Wellner [14], Janson [11] and references therein for a detailed 
presentation and reviews of known results on this topic. 

The starting point of this work lies in the observation that one can express the area in the 
form 



poo 

a Exc = / \e(t)\dt 

Jo 



where \0(t)\ denotes the Lebesgue measure of the random open set 9(t) ={s£ [0, 1] : e s > t}. 
The point is that the process 9 = (6(t),t > 0) is a self-similar interval-fragmentation in the 



Taboratoire de Probabilites et Modeles Aleatoires, UPMC, 4 Place Jussieu, 75252 Paris Cedex 05; France. 
Email: jean.bertoin@upmc.fr 



1 



sense of [3, 5]. One can derive an integro-differential equation for the distribution of Ae xc from 
the branching and self-similarity properties of 9. In particular this yields recursive formulas for 
the entire moments of Ae X c that have been obtained in the literature by analytic techniques 
based on the Feynman-Kac formula or the analysis of continued fractions and of singularities 
of the generating functions of discrete approximations. 

The same approach applies more generally to self-similar fragmentation processes, a class of 
Feller processes with values in the space of mass-partitions 

oo 

V m — {x — (xi, x 2 , ■■■)'. Xi > x 2 >■■■> and < 1} ■ 

i=i 

Specifically, a self-similar fragmentation X = (X(t), t > 0) fulfills the following two fundamental 
properties. For x G [0, 1], let us denote by F x the law of the version of X started from a 
single fragment of mass x, i.e. X(0) = (x, 0, ...). First, the self-similarity means that there 
exists an index a G 1 such that for every x G (0, 1] the distribution of the rescaled process 
(xX(x a t),t > 0) under P x is P x . Second, the process X satisfies the branching property, 
in the sense that for every mass-partition x = (xi,x 2 , . . .), if X^\X^, . . . are independent 
fragmentations with respective laws F X1 , P X2 , . . ., then the process resulting from the decreasing 
rearrangement of all the fragments of X.^\t), X.( 2 \t), . . . is a version of X(i) started from 
X(0) = x. 

We assume that the index of self-similarity a is negative, which implies that small fragments 
split faster than the large ones. A well-known consequence is that the process of the total 
mass t i — y J2iliXi(t) decreases and reaches in finite time a.s.; in other words the entire 
mass is eventually ground down to dust. This has been observed first by Filippov [7], see also 
Proposition 2(i) in [4]. We may thus define the area 




which is the main object of interest in this work. The denomination area is better understood 
if we remember that a fragmentation admits an interval representation; cf. Section 3.2 in [3]. 
There exists a nested right-continuous family (9(t),t > 0) of open subsets of the unit interval 
such that for every t > 0, the sequence of the lengths of the interval components of 9(t) listed 
in the decreasing order is precisely X(t) = (Xi(t), . . .). If we define a lower semi-continuous 
path F : [0, 1] M+ by 

F(u) = sup{t > : u G 9{t)} , u G [0, 1] , 
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then 6{t) = {u : F(u) > t}, and since = I^WI> we can express A in the form 

A = J Q L F(u)du. Of course F = e is the Brownian excursion in the situation discussed at the 
beginning of this introduction. 

The area A has another natural interpretation in terms of continuous random trees. Indeed, 
Haas and Miermont [9] obtained a representation of self-similar fragmentations in terms of 
some rooted continuous tree T which enjoys a self-similarity and branching properties. More 
precisely, for every t > 0, X(£) = (Xi(t),...) can be viewed as the ranked sequence of the 
masses of connected components of T(t), the subset of the points in T at distance at least t 
from the root. In this setting, 



where |T(t)| denotes the mass of T(t). Hence A represents the average height in T, i.e. the 
average distance of points to the root, where averaging is taken with respect to the mass 
measure of T. We also refer to [19] for results on this quantity in the framework of certain 
discrete random trees. 

In the next section, we will present our main result which determines the law of A as the 
unique solution to an intro-differential equation expressed in terms of the characteristics of X. 
In the case of binary dislocations, this enables us to derive explicit recursive formulas for the 
moments of A. We recover in particular identities due to Takacs [16] for the moments of the 
area of the Brownian excursion. Section 3 is devoted to the proof of this integro-differential 
equation. We shall start by establishing a priori bounds for the moments of A. Then we proceed 
with the simpler case when the dislocation measure is finite, and derive the equation from a 
recursive distribution equation which is naturally induced by the dynamics of fragmentation. 
The general case when the dislocation measure is infinite is then deduced by approximation. 
This relies on a weak limit theorem for the area of fragmentation processes. The proof of the 
latter is somewhat technical and will be postponed to the final subsection. 



We denote by v the dislocation measure of X, so v is a sigma-finite measure on V m with no 
atom at the trivial mass-partition (1, 0, . . .) and fulfills the integral condition 



We implicitely exclude the degenerate case when v = and further assume absence of erosion. 
Roughly speaking, this means that X is a purely discontinuous process that only evolves by 




2 Main results 




(i) 



3 



sudden dislocations whose rates are determined by v and the index of self-similarity a. We 
refer to Chapter 3 of [5] or [3] for background. 

For an arbitrary mass partition x, we denote by 

r] x (da) = P X (A G da) , a>0, 

the law of the area under the probability measure P x for which the fragmentation X starts 
from X(0) = x. For the sake of simplicity, we will work from now on under the law P = Pi, 
i.e. when the fragmentation starts from a single fragment of mass 1, and write then r\ — r\\. 
This induces no loss of generality since, combining self-similarity and the branching property, 
we get that for every mass-partition x = (x±, x 2 , ■ ■ •), there is the identity 

^(da) = P^x l 1 " a AGdaj (2) 

where (-Ai)igN is a family of i.i.d. copies of A. Note that when x has only finitely many non-zero 
terms, then 7] x can be expressed as a convolution product rj x — rj Xl * • • • * rj Xn 

where r\ y stands for the image of rj by the dilation a i— > y 1 ~ a a. Finally, we let (/i, f) — J fdfi 
denote the integral of some function / with respect to a measure /i when the integral makes 
sense. We are now able to state our main result 



Theorem 1 Let X be a self- similar fragmentation with index a < 0, dislocation measure v and 
without erosion. Then for every C 1 function f : M + — > R such that f'(y) = 0(y p ) as y — > oo 
for some p > 0, the law rj of A solves 

fa,/')= / i/(dx)(fa,/)-<ik,/» , (3) 

where the quantities above are finite. Further (3) characterizes rj. 



If we introduce the concave increasing function $ : P + — > R + by 

*(,,) / 1 1 - > z; + «|,..<ix). . I) 



then we immediately see from Theorem 1 that the first moment of A is given simply by 

E(A) = (77, Id> = l/$(-a). 
This identity can also be established directly; see the forthcoming Lemma 1. 



4 



More generally, we shall now derive from Theorem 1 a recursive formula for the entire 
moments of A. For the sake of simplicity, we shall focus on the special case of binary dislocations, 
although more general situations could be dealt with at the price of heavier notation. This 
means that we assume that the dislocation measure v has support in the subset of binary 
mass-partitions {x = (x, 1 — x, 0, . . .),x G [1/2, 1)}. By a slight abuse, we shall then identify 
the dislocation measure v with its image by the map x — >■ x±, i.e. we view v as a measure 
on [1/2,1). Specializing Theorem 1 to f(x) = x k and applying the binomial formula, we 
immediately obtain: 

Corollary 1 Let X be a self- similar fragmentation with index a < 0, binary dislocation mea- 
sure v and without erosion. For every integer k > 0, let M k = K(A k ) denote the k-th moment 
of the area. Then there is the identity 

k-1 

a k M k = fcM fc _i + HkMjMk.j , k > 1 , 

3=1 



with 

at 



[ (1 - x k{1 - a) - (1 - x) k ^ a) )u(dx) = $(jfe(l - a) - 1) , 



and 



r ] [ x^- a \l - x f-^-^uidx) 

\jj J [1/2,1) 



We stress that (1) ensures the finiteness of a k and a^. 

We now discuss some examples, starting with the case of the Brownian fragmentation A = 
A Exc which has motivated this work. The Brownian fragmentation has self-similarity index 
a — — 1/2, no erosion, and its dislocation measure is binary and specified by 

2 

v(dx) = dx , 1/2 < x < 1 ; 

^2^(1 - xf 

see [3] on its pages 339-340. One gets by symmetry 

x Sk/2 _ (1 _ xfk /2^ = ^^(3^ - l)/2) 



-r- 

Jo 



o ^2nx%l -xf ' r(3fc/2-l) 
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and 



fe!r((3j-l)/2)r((3(fc-j)-l)/2) 



V27Tj\(k-j)\T(3k/2-l) 

Following Takacs [16], if we set 

fc_ r((3fc-i)/2) *' 

then after some cancellations, Corollary 1 reduces to 

fc-1 

K k = (3fc/4 - l)K k _^ + K j K *-J 

3=1 

with K = —1/2. This is the recursive equation found by Takacs, which in turn yields a Riccati 
type ODE by considering the exponential generating function of the K k ; see Flajolet et al. [8]. 
We mention the existence of other recursive formulas for the moments of A Exc , see in particular 
[13] and the discussion in [11]. 

Similar calculations apply when more generally the dislocation measure is of beta-type, i.e. 
is binary with 

u(dx) = cx p (l - x) p dx , 1/2 < x < 1 

for some —2 < /3 < — 1. These beta-splitting measures have appeared in works of Aldous [1] 
on cladograms; see also Section 5.1 in [10]. One obtains 



-a k = f (1 - x k(1 ~ a) - (1 - x) k{1 - a) ) X P{\ - xfdx 
c Jo 

= B(/3 + l,/3 + l)-2B(/3 + l + fc(l-a),/3 + l) 

= HM±5) Btf + 2, + 2) - tV + ' + W-^ BV + 1 + *(1 - „), fi + 2) 
where B(a, 6) = Y(a)T(b)/T(a + 6) is the beta function, and 



2 k 
-a>j,k - 
c \J 

V 



j j\^- a \l - xf+^-^-^dx 



B(P + j(l - a) + 1, + (fc - j)(l - «) + 1) 
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Finally, note that we can also deal with linear combinations of the beta dislocation measures, 
which covers for instance the case of Ford's alpha model; see Section 5.2 in [10]. 



3 Proof of Theorem 1 

This section is devoted to the proof of Theorem 1; it relies in four main steps. In the first sub- 
section, we establish a priori bounds for the moments of the area, relying on known properties 
of the so-called tagged fragment. In the second sub-section, we prove the equation (3) in the 
special case when the dislocation measure is finite. In the third sub-section, we provide the 
proof of Theorem 1 by approximation, taking for granted a weak convergence result for the 
area that will be established in the final sub-section. 

3.1 Bounds for the moments of the area 

The purpose of this subsection is to establish some a priori bounds on the moments M k = E(A k ) 
of the area. Recall the notation (4). 

Lemma 1 We have 



As a consequence E(exp(cA)) < oo whenever c < 3>(oo), and in particular the law rj of A is 
determined by its entire moments. 

Proof: It is convenient to work in the setting of interval-fragmentation, i.e. when the frag- 
mentation X describes the ranked sequence of the lengths of the interval components of nested 
open subsets (9(t),t > 0). Recall from the introduction that 9(t) can be expressed in the form 
9(t) = {ii£ [0, 1] : F(u) > t} and note that for every integer k > 1, there is the identity 



Mi = l/$(-a) 



and for k > 1 



M k <k 



k\ 



$(-a) • ■ ■ $(—ka) ' 




In other words, we have 



E(A k ) 



E(F(C/ 1 )-.-F(C/ fe )) 



where Ui, . . . , U k are i.i.d. uniform variables on [0, 1]. This yields 



Mi = E(F(U)) and M k < kE(F(U) k ) 
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where U has the uniform distribution on [0, 1]. 

The variable F(U) should be viewed as the lifetime of the tagged-fragment, i.e. it is the first 
instant t when the size x(t) of the interval component of 6{t) that contains the randomly tagged 
point U reaches the absorbing state 0. This variable has the distribution of an exponential 
functional, 

roc 

F(U) ( =' / = / exp«t)d* 
Jo 

where £ = (£ t ,i > 0) is a subordinator with Laplace exponent <£>; see Corollary 2 in [3]. Since 
it is well-known that 

E(I k ) IA 



(cf. for instance Theorem 2 in [6]), the first two claims are proved, and the last ones follow 
immediately as the function $ increases. □ 



3.2 The case with finite dislocation rates 

In this subsection, we assume that the fragmentation process has a finite dislocation measure, 
i.e. v(V m ) G (0, oo). This means that under P, the process X stays in state (1, 0, . . .) during 
an exponential time T with parameter z/("P m ), and then, independently of the waiting time 
T, jumps at some random mass partition X(T) whose distribution is given by the normalized 
dislocation measure z//z/("P m ). We stress that the jump times of X may nonetheless accumulate 
right after T; in particular then X is not a continuous-time Markov chain. Indeed, the first 
dislocation may produce fragments of arbitrarily small sizes, which then split again almost 
instantaneously by self-similarity. 

The proof of the following weaker version of Theorem 1 in this setting is straightforward. 

Proposition 1 Assume thatX. has no erosion and finite dislocation measure v. Then for every 
C 1 function f : R + — > R such that the derivative f has a finite limit at oo, we have 

( V ,f'}= [ Kdx)((77,/)-(77 x ,/)) . 

Proof: An application of the strong Markov property at the first dislocation time T and (2) 
yields the recursive distributional equation (see the survey [2] for much more this topic) 

oo 

A = T + Y,X % {Tf- a A l 
i=i 

where (Aj) i6 pj is a sequence of i.i.d. copies of A which is further independent of X(T). This 
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entails 



If 



where £(q) = E(exp(— qA)) is Laplace transform of the area. By rearrangement, we arrive at 

-q£(q) = jf i/(dx) ^(g) - n^i" Q ?)j . 

which is the equation in the statement specified for f(a) = e~ qa . This establishes our claim by a 
standard application of the Stone- Weierstrass theorem (recall that that A has finite moments). 
□ 

Let us briefly discuss the elementary example when v = 5(i/2,i/2,o,...)- We thus start with a 
single fragment of unit size which splits in two fragments each of size 1/2 after an exponential 
time with parameter 1, and so on. It should be plain that the area can then be expressed in 
the form 

A = e ,i + 2 a ~ 1 (e lil + e lj2 ) + 2 2(a - 1) (e 2>1 + e 2 , 2 + e 2 , 3 + e 2 , 4 ) + • • • , 

where the e^- for j = 1, . . . , 2* and % = 0, 1, . . . are i.i.d. standard exponential variables. The 
Laplace transform of A is thus given by 



°° / 1 \ 2 " 

^) = n( l + 2 n(a-l)J > 
n=0 x ' 



and the equation 

-q£(q) = £(q) - t(T~\) 2 
provided by Proposition 1 can be checked directly. 



3.3 Proof of Theorem 1 by approximation 

In this subsection, we shall derive Theorem 1 by approximation from the case when the dislo- 
cation measure is finite, taking for granted the weak convergence of the corresponding areas. 
Specifically, we introduce the finite measures 

z/ (n) (dx) = l {1 _ a . 1>1/n} i/(dx) , xeP m , 

where n is a sufficiently large integer so that i» ^ 0. We write A^ for the of a self-similar 
fragmentation process with index a, dislocation measure and without erosion, and denote 
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by rj^ the distribution of A^ n \ Recall (2) and set for a generic mass-partition x 

/ oo N 



j=i 



where (A\ n) : i e N) is a sequence of i.i.d. copies of A*"). The following crucial lemma will be 
established in the next sub-section. 

Lemma 2 The sequence converges weakly to rj as n — >■ oo. 

The next step to the proof of Theorem 1 is the following technical result. 
Lemma 3 (i) Let f : M + — > R be continuous and bounded. Then for every x e P m; 

lim fa£ B \/> = <W>- 

n— >oo 

(ii) Let / : M + 4 R 6e a C 1 function with f'(y) = 0{y p ) as y — )■ oo /or some p > 0, and 
se£ ll/'H = sup{|/'(x)|/(l + x) p : x > 0}. There is a constant c depending only on p and the 
characteristics of the fragmentation such that for every mass-partition x and every n 



\(v {n \f)-(v£\f)\<c\\f'\\(l 



Xi) . 



Proof: (i) Denote by k the cumulant of A, i.e. E(exp(— qAj) = exp(— n(q)) for q > 0, and by 
that of A^ n \ If (A\ n \i G N) is a sequence of i.i.d. copies of A^ n \ then we have 



oo 



E exp -aj]^ 1 -^ = exp 



i=i 



We know from Lemma 2 that lim^oo (x\ a q) = k(x] a q) for every q > and i G N. 
Further, is a concave increasing function with K,^ n \0) = 0, and its derivative at is given 
by E(A^). Recall also from Lemma 1 that E(A^) = l/¥ n \-a) where 

* (n) (?) = jf ^i-f;x;^i/W(dx) = jf i {1 _ xl>1/B} ^i-f;x;^i/(dx). 

Plainly the sequence $("■)(— a) increases and there are thus the bounds 

K (n \x}- a q) < x}- a q/& n \-a) < cx}~ a q. 
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Since x \ a — 1> we conclude by dominated convergence that 

oo oo 
m — yoo * * * * 



n— >oo 1 



By Laplace inversion, this yields our claim. 

(ii) Recall that . . . are i.i.d. copies of A^ n \ and set = X]^ 1 ! 1 '"^ where x = 
(xi, . . .) is a generic mass-partition. We have 

\(v (n \f) - (V ( x\f)\ < + 

< \\f\\E ((1 + 4 n) + S<»>)* ((1 - x\- a )A^ + S<»>)) . 

Recall that Y1T x i — 1- Applying Rosenthal's inequality for the moments of sum of inde- 
pendent nonnegative variables and then Jensen's inequality, we obtain 

E ((1 + 4 n) + S^) 2p ) < c(2p)(l + m 2 ( ; } ) 

and 



E 



((1 - x\- a )A? + S<">) ) < c(2)(l - xx) 2 MW , 



where c(2p) and c(2) are some numerical constants and denotes the k-th moment of A^ n \ 
We conclude from Holder's inequality that 

\(v (n \ f) - v [ x\f)\ < c\\f\\(i - x 1 )y / M 2 (n) (i + < ) ). 

We can complete the proof with an appeal to Lemma 1, recalling that the sequence n >->■ & n \q) 



increases for every q > and thus sup n y (1 + M 2p ) < oo. □ 

We can now proceed to the proof of Theorem 1. 

Proof of Theorem 1: We first suppose that / and /' are bounded. Then (3) follows 
from Proposition 1 and Lemma 3 by dominated convergence. Next, we only assume that 
/ is of class C 1 with f'(y) = 0(y p ) as y — > oo. Then it is easy to construct a sequence 
(/„ : n G N) of functions of class C 1 with /„ and f' n bounded such that f n — > f and f' n — > 
f pointwise and sup n ||/^|| < oo in the sense of Lemma 2. This implies that we also have 
sup„sup x > \f n (x)/(l +x p+1 )\) < oo. 

Since all moments of A are finite, we deduce from Lemmas 2 and 3(i) that 
(V, fn) -> (V, f) , (V, f'n) ->• (V, f) an d fa*, /„) -»• fax, /) 
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where x is an arbitrary mass-partition. Since sup n ||// t || < oo, Lemma 3(ii) enables us to apply 
dominated convergence, and we conclude that (3) holds. 

We now turn our attention to uniqueness; we consider an arbitrary solution r/ to (3) and 
write M' k for the fc-th moment of 77'. We have already observed in the introduction that the first 
moment M[ of 77' can be computed in terms of the function $ defined by (4). More generally, 
specifying (3) for f(x) = x k yields an equation of the form 

<b(k{l-a)-l)M , k = m k {M[ ) ... ) M' k _ 1 ) 

for a certain multinomial function Hence M' k = M k for every k e N, and since we know 
from Lemma 1 that the moment problem for 77 is determined, this concludes the proof. □ 



3.4 Proof of Lemma 2 

We shall finally establish Lemma 2 using the framework of homogeneous fragmentations with 
values in the space of partitions of N. Given a measure v which fulfills (1), we first consider 
a homogeneous fragmentation II = (U(t),t > 0) with dislocation measure v and no erosion. For 
every t > the random partition Il(t) is exchangeable and we write |IL(t)| for the asymptotic 
frequency of the i-th block of II(t). The self-similar fragmentation X is related to II by a sort 
of time-change described in Theorem 3.3 of [5], and if we introduce 



(Ein.(t)M dt, 



then the connexion between II and X implies that A and A have the same distribution. 

It will be convenient to approximate A by Riemann sums. More precisely, for every integer 
k > 1 we define 

k 2 00 



k 

i=\ i=i 

The L 1 distance between these two quantities is easily computed in terms of the function $ 
defined in (4). 

Lemma 4 For every k > 1, we have 

EfU -A,\)= 1 - fc -i l-exp(-A;$(-a)) 
**\) exp($(-a)/ifc) - 1 • 

Proof: The process |IT(-) |^ of the ranked sequence of the asymptotic frequencies of II(-) is 
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a mass- fragmentation; as a consequence t I n« (t ) | 1 a decreases and we have Ak < A. 

Hence E(|.A — Ak\) = E(^4) — E(^4 fc ) and the stated formula now follows from the fact that 

E^ln^)! 1 -^ = exp(-f*(-a)) ; 

see Corollary 2.4(i) and Theorem 3.2 in [5]. □ 

Next, for every integer n, we write IT( n ) for the homogeneous fragmentation with disloca- 
tion measure z/^(dx) = l{ 1 _ xi>1 / n yis(dx) and no erosion. We point at the following weak 
convergence. 

Lemma 5 The sequence of processes (IT^ :nGN) converges in the sense of finite dimensional 
distributions to II as n — > oo. 



Proof: If ip G V® is a partition and k > 1 an integer, then we denote by the restriction 
of p to the set of the k first integers, [&] = {1, . . . , k}. We also endow Vn with the ultra- metric 

d(ip, ip') = 1/ sup{£; > 1 : ip m = ip\ [k] } ; 

cf. Lemma 2.6 in [5]. 

The restriction II| of LI(i) to [A;] is a Markov chain in continuous time, and we have to 
verify that II converges in the sense of finite dimensional distributions to II|[ fc ] as n — > oo, for 
each k. This is equivalent to checking the convergence of the corresponding jump rates of the 
Markov chains. 

For every non-trivial partition 7 of [k], we write 

g 7 = limr 1 P(n| W (t)= 7 ). 

Recall from Theorem 3.1 and Proposition 3.2 in [5] that the jump rate g 7 can be expressed in 
terms of the dislocation measure v as 

? 7 = / £x(7Mdx) 

where x (t) is the distribution of Kingman's paintbox process 7r x based on x. This means 
that we consider a sequence £1, • • • of i.i.d. variables with = i) — xi for i > 1 and 
P(£i = 0) = 1 — x ii an d is the exchangeable random partition which is obtained by 
declaring that two integers i 7^ j are in the same block of 7r x if and only if £j = £j 7^ 0. 
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Writing q^ 1 for the jump rate of 11^ , we thus have 

Qj-q^ l) = / l{i-*i<i/n}£x(7Mdx) 



It is plain from the paintbox construction that for every mass-partition x, the probability 
that the paintbox process based on x yields the trivial partition on [k] is at least x\. Thus 
£?x(t) < 1 — a^i < k{\ — Xi) for every non-trivial partition 7 of [k]. We conclude from (1) that 
Hindoo = g 7 , which establishes our claim. □ 

We are now in shape to prove Lemma 2 

Proof of Lemma 2: The space V m of mass-partitions is a compact metric space when endowed 
with the uniform distance (Proposition 2.1 in [5]) and the map x — > Y^=i x \~ a ls continuous 
and bounded. We write |n(t)|^ for the sequence of the asymptotic frequency of LT(i) ranked in 
the decreasing order. Recall from Proposition 2.9 in [5] that Lemma 5 entails the convergence 
in the sense of finite dimensional distributions of (|L T ( n )(t)|-'', t > 0) towards (|n(i) t > 0). 
Therefore, for every k > 1, we have also in the obvious notation 



lim A k = Ak in law. (5) 

n—>oo 



Let / : M_)_ — > R be a bounded function which is globally Lipschitz-continuous, so by the 
triangle inequality 

\E(f(A)) - E(/CA<">))| < |E(/(A)) - + c f (E(\A - A k \) + E(\A^ - A k n) \)) . 

It follows readily from Lemma 4 that 

lim E( \A {n) - A[ n) I ) = uniformly in n , 

so for every e > 0, we can find an integer k sufficiently large such that 

E(\A- A k \) +E(\A {n) - A[ n) \) < e/2c f for all n, 

and then we use (5) to find an integer n £ such that \E(f(Ak)) — E(f(A k n ^))\ < e/2 whenever 
n > n £ . □ 

Acknowledgments. This work has been supported by ANR-08-BLAN-0220-01. 
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